Scene image representation by foreground, background and hybrid features
نویسندگان
چکیده
Previous methods for representing scene images based on deep learning primarily consider either the foreground or background information as discriminating clues classification task. However, also require additional (hybrid) to cope with inter-class similarity and intra-class variation problems. In this paper, we propose use hybrid features in addition represent images. We suppose that these three types of could jointly help image more accurately. To end, adopt VGG-16 architectures pre-trained ImageNet, Places, Hybrid (both ImageNet Places) datasets corresponding extraction foreground, information. All are further aggregated achieve our final representation Extensive experiments two large benchmark (MIT-67 SUN-397) show method produces state-of-the-art performance.
منابع مشابه
Object Category Recognition by Bag-of-Features Using Co-Occurrence Representation by Foreground and Background Information
متن کامل
Thesis Proposal: Learning Image Patch Representation for Detection, Recognition and Dynamic Foreground/Background Extraction
Using feature based (ie. interest image features that satisfy some metrics of geometric or photometric invariance [39, 50, 66, 74, 73]) or direct image methods (ie. derivatives or differences of image intensity patterns [2, 37, 49]) for 3D object/scene model construction [48, 97], image content retrieval [85, 14], object recognition [24, 66], video structure matching/parsing [83, 93, 92] and au...
متن کاملDisentangling Motion, Foreground and Background Features in Videos
This paper introduces an unsupervised framework to extract semantically rich features for video representation. Inspired by how the human visual system groups objects based on motion cues, we propose a deep convolutional neural network that disentangles motion, foreground and background information. The proposed architecture consists of a 3D convolutional feature encoder for blocks of 16 frames...
متن کاملEnhancement of Learning Based Image Matting Method with Different Background/Foreground Weights
The problem of accurate foreground estimation in images is called Image Matting. In image matting methods, a map is used as learning data, which is produced by those pixels that are definitely foreground, definitely background ,and unknown. This three-level pixel map is often referred to as a trimap, which is produced manually in alpha matte datasets. The true class of unknown pixels will be es...
متن کاملInteraction between Attention and Bottom-Up Saliency Mediates the Representation of Foreground and Background in an Auditory Scene
The mechanism by which a complex auditory scene is parsed into coherent objects depends on poorly understood interactions between task-driven and stimulus-driven attentional processes. We illuminate these interactions in a simultaneous behavioral-neurophysiological study in which we manipulate participants' attention to different features of an auditory scene (with a regular target embedded in ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Expert Systems With Applications
سال: 2021
ISSN: ['1873-6793', '0957-4174']
DOI: https://doi.org/10.1016/j.eswa.2021.115285